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The problem of Information Retrieval is, given a set of documents D and a 
query q, providing an algorithm for retrieving all documents in D relevant to q. 
However, retrieval should depend and be updated whenever the user is able to 
provide as an input a preferred set of relevant documents; this process is known 
as relevance feedback. Recent work in IR has been paying great attention to 
models which employ a logical approach; the advantage being that one can have 
a simple computable characterization of retrieval on the basis of a pure logical 
analysis of retrieval. Most of the logical models make use of probabilities or 
similar belief functions in order to introduce the inductive component whereby 
uncertainty is treated. Their general paradigm is the following: find the na- 
ture of conditional d — > q and then define a probability on the top of it. We 
just reverse this point of view; first use the numerical information, frequencies 
or probabilities, then define your own logical consequence. More generally, we 
claim that retrieval is a form of deduction. We introduce a simple but powerful 
logical framework of relevance feedback, derived from the well founded area of 
nonmonotonic logic. This description can help us evaluate, describe and com- 
pare from a theoretical point of view previous approaches based on conditionals 
or probabilities. 

The first difficulty one encounters towards a logical approach to Information 
Retrieval is how one should view the algorithm which returns a set of relevant 
documents. Our proposal is to see such an algorithm as a proof. However, this 
proof is not classical because, as it is widely recognized, q — > d, where q is a query 
and d a document relevant to q, is not conveyed by the material implication 



(see |vR86|,[vR89|). Also, as the set of data which this proof is based is subject 
to updating, nonmotonicity arises. Since there is no need of nesting implications, 
implication and deduction can be identified and therefore our starting point is 
to axiomatize a deduction ghd. The consequence relation K will reflect the 
properties that relevance satisfies. A similar approach where nonmonotonic 
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Tabic 1: Rules for Nonmonotonic Inference 



consequence relations are used to axiomatize a notion of aboutness appears in 
[ BH95 ]. We shall then show how l~ changes with relevance feedback. 

A set of rules for the relevance relation l~ appears in Table ^ T his set o f 
rules apart from Rational Monotonicity comprises the system P (see [ KLM9C| ) 
of preferential inference. Preferential inference along with the rule of Rational 



Monotonicity comprises the system R of rational inference (see [LM92 ). We 



propose preferential and rational inference as the simplest and strongest systems 
axiomatizing the notion of relevance. 

However, it is well known that designing a nonmonotonic system is far from 
trivial. Moreover, nonmonotonicity does not correspond directly to statistical 
data as statistical operations are in general continuous or monotonic. A solution 
to this problem is given by a recent characterization of the systems of preferen- 
t ial and r ational inference through systems of monotonic consequence relations 
(| Geo96b |) called priority relations. A presentation of priority relations follows. 
Our language of terms will contain only conjunction (A). The reason we do not 
employ a notion of negation is that there is no notion of negation which reflects 
our intuition in IR. The standard way to look at negation is ruled by the closed 
world assumption: if a term does not occur in a document then its negation 
occurs. However this makes documents possible worlds and deduction would 
turn out classical in our model. Hence negation will be replaced by another 
construction (see below). Documents will be identified with the conjunction of 
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all terms that occur in them. So, if {tj.,. .. , t n } are all documents occurring in 
a document d, then d = t\ A . . . A t n . Our model is based on a recent character- 
ization of nonmonotonicity by the second author through a family of relations 
among formulas called priority relations. A priority relation satisfies 

1. t\ k t\ (Reflexivity) 

2. ti\=t 2 and t 2 k t 3 implies t\ k t 3 (Monotonicity) 

3. h\=t2 and t 2 \=ti implies t 3 k t\ iff £ 3 k t 2 , (Logical Equivalence) 
where |= is classical. If a priority relation satisfies, in addition, 

4. t 3 k t\ and t 3 k £2 implies i 3 k t\ A t 2 , (Right Conjunction) 

5. t\ k t 2 and t 2 k i 3 implies t\ k i 3 , (Transitivity) 

it will be called preferential ordering. If a preferential ordering satisfies, in ad- 
dition, 

6. t\ k t 2 or ^2 k t\ (Connectivity) 
it is called rational ordering. 

We now have the following theorem ( jGeo96a ],|Geo|): Preferential and Ra- 
tional inference relations are generated by preferential and rational orderings, 
respectively, through a maxiconsistent inference scheme. Moreover, this corre- 
spondence is bijective. This is a powerful machinery for handling nonmonotonic 
deductions. Once we build a preference (preferential or rational) ordering, we 
are able to define and compute the associated nonmonotonic consequence rela- 
tion. Turning conjunction to union of sets of terms the defining conditions of 
priority relations become: 

1. UKU (Reflexivity) 

2. [/CI/ and U k W implies VHW (Monotonicity) 

3. U k V and U k W implies U k V U W, (Right Union) 

4. U k V and V k W implies U k W, (Transitivity) 

5. U k V or V k U (Connectivity) 

One can think of these monotonic consequence relations as aboutness re- 
lations. That aboutness sh ould be represented as a monotonic consequence 
relation appears already in (| HD95 |) along with a mix of nonmonotonic rules. 
Moreover, the above duality theorem translates to the fact that relevance and 
aboutness are really dual notions. We shall now show how one can easily gener- 
ate a preferential ordering from frequency information. We show how the user's 
relevance defines a nonmonotonic consequence relation which is represented by 
a preferential ordering. In IR, a way to define the informative content of terms is 
given by the frequencies in the document collection (as in the Robertson Sparck 
Jones formula): this is enough for us to carry on deductions! A way of con- 
structing a preferential ordering among terms is to divide the set of documents 
that the user distinguishes into two subsets, the positive set (D + ) and negative 
set (D~). Then the set of positive and negative examples will use frequencies 
to construct two rational orderings. Then these orderings will be combined in 
a preferential ordering which reflects our intuitions for relevance. Then a doc- 
ument d = t'i A . . . A t' m will be relevant to query q = tx A . . . A t n just in case 
t\ A . . . A t n k t\ A . . . A t' m . The construction is the following. Denote with 
the set of positive relevant documents where t occurs, similarly for the nega- 
tive case. The frequency of t relative to D + is \D^\. Now define the following 
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orderings 
and 



<ik p i 2 iff \D+\<\D+\ 



t 2 K„ti iff \D^\<\D^\. 
It can be shown that k p and k „ are rational orderings. Now let 

ti k t 2 iff ti k p t 2 and ti k n t 2 . 

The consequence relation k is a preferential ordering. It is now clear that if 
we perform an update of the sets of positive and negative relevant documents 
with relevance feedback then k changes, too. In particular, this change is 
nonmonotonic. 

Consider the following example. The following matrix is the matrix of oc- 
currences between documents d\, d 2 , 0I3, d± and terms ti, t 2 , £3, £4. 
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We have ti k P {t3, t^} k p t 2 and ti k n {ti, £3} k n t 2 - Thus ti k £3 k t 2 and 
ti k t 2 . For example, t 3 A ti k c?i and £3 A £4 k c?3, that is the query £3 A £4 
returns e?i and dz as relevant documents, but not d 2 and e?4. This provides a 
model for computing the relevance of documents with respect a query on the 
basis of some evidence. 

Consider now a new document <i 5 = t\ A t 2 . We have that cZ 5 is not to- 
tally ranked among the rest of the documents: we can decide just d§ k d 2 and 
di k g?5 . Let U be the number of undecided documents, D+ be the set of posi- 
tive documents d such that d k cfe, D~ be the set of negative documents d such 
that d$ k d, .D+ be the set of positive documents d such that d$ k c?, and D~ 
be the set of negative documents d such that d k ^5. If we want to create such 
a linear ordering among documents we can use the the Robertson Spark Jones' 
formula and relate it to the number of incomparable documents. Then 

N-U, /D+xD- 
log 1 ' 



N 



can be used as a decision rule. In the above example, we get a negative value 
for d§ which is then chosen as irrelevant. 

A point whose importance should be stressed is that generation of a non- 
monotonic consequence relation (relevance) through priority relations depends 
on a notion of negation. Only in presence of negation we are able to define maxi- 
consistent inference (see [ Gco96b[ ). We shall now show how a recent approach 
to IR (|AvRU96|) using expected utility functions defines a natural notion of 
negation. Suppose that the user supplies two sets of documents representing 
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positive and negative examples. Through theses sets and the frequencies of 
terms appearing in those sets one can construct for every term a contingency 
matrix. Using now entropy H and Hintikka's content C, one can define a weight- 
ing function from the set of terms to the interval [—1,1]. Let w be the weighting 
function. Set w(^t) = —w(t) and w(t A t') = mm(w(t),w(t')). Let 

r(i] = { W[t) ifw W >0 
^ ' [0 otherwise 

The function r ranks terms according to their utility. Now the following relation 

h H t 2 iff r(ti) < r(t 2 ) 

is a rational ordering, and the inference 

t\ l~ t 2 iff h A t 3 h ^2, for some £3 ^ — 1*1 

is a rational nonmonotonic inference. The above relation can be readily extend 
to all boolean combinations of terms. However, we consider only conjunctions 
of positive terms. Then the rational inference can be effectively checked through 
the following equivalent definition (ti and t 2 are sets of terms) 

t\ l~ t 2 iff either t 2 C t\, 

or t2 — t\ \fi —iti . 

As an example consider three terms t\, t 2 and t 3 and their contingency 
matrices based on a set of 300 relevant documents and a set of 700 irrelevant 
documents: 
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The weighting function now gives the following 

w{t{) = -.802 r{ti) = 

w (t 2 ) = -.885 r{t 2 ) = 

w(t 3 ) = -845 r(i 3 ) = .845 

w(t 1 At 2 ) = -.885 r(*iA* 2 ) = 

We now have ti h- t 3 , since .845 ^ .802, but hAt 2 ^t 3 , since .845 < .885. 
A table of all possible derivations between terms appears below: 
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This logical framework is just an initial suggestion for future research, which 
shows that nonmonotonicity, induction and IR can have many common features 
which are worthwhile to explore. 
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